CDS

Accession Number TCMCG074C30762
gbkey CDS
Protein Id KAF8412369.1
Location join(6301895..6301897,6302057..6302186,6302266..6302408,6303366..6303452,6304261..6304335)
Organism Tetracentron sinense
locus_tag HHK36_000333

Protein

Length 145aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000001.1
Definition hypothetical protein HHK36_000333 [Tetracentron sinense]
Locus_tag HHK36_000333

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00337        [VIEW IN KEGG]
M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02732        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005839        [VIEW IN EMBL-EBI]
GO:0019774        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGTCTATCAGCATCAACACAACAAGCAGATGAGCTGTCCTGCAATGGCTCAACTCCTCTCCAACACTCTTTACTACAAACGTTTCTTCCCCTATTATTCTTTCAACGTATTAGGTGGCCTCGATAATGAAGGAAAGGGTTGTGTGTTCACATACGATGCAGTCGGATCCTATGAGAAGGTTGGATACAGCTCCCAAGGTTCTGGTTCTACGCTCGTCATGCCCTTTTTGGACAACCAACTGAAGTCTCCGAGCCCTCTCTTATTACCTGCCCAGGATGCCGTGACTCCACTTTCCGAATCAGAAGCAATTGACTTGGTAAAAGTTGTTTTTGCATCTGCAACTGAAAGGGATATATACACTGGAGACAAGCTGGAAATAGTCATCTTAAACGCTGATGGTATTCGGCATGAATATATGGATCTCAGGAAAGATTGA
Protein:  
MVYQHQHNKQMSCPAMAQLLSNTLYYKRFFPYYSFNVLGGLDNEGKGCVFTYDAVGSYEKVGYSSQGSGSTLVMPFLDNQLKSPSPLLLPAQDAVTPLSESEAIDLVKVVFASATERDIYTGDKLEIVILNADGIRHEYMDLRKD